VIDAS: Video Assisted Audio Coding and Representation

ثبت نشده
چکیده

Goals The basic objective of VIDAS was that of approaching the problem of facial animation from a combined audio-video point of view, for both the analysis and the synthesis. The main goal was the consolidation of advanced technologies for synthetic/natural representation and coding of facial sequences. In particular, a generic face-to-face communication has been modelled as a multimodal information source characterised in audio by a human speaker’s voice and in video by the same speaker’s face. In the vast majority of cases, in fact, an interpersonal communication consists exactly of these two strongly correlated items: a speaking face together with its synchronous speech. From the technical point, a major goal addressed by the project was that of implementing a software prototype for synthetic facial animation, compliant with the new MPEG-4 standard, capable of reproducing the speaker’s face through a 3D deformable model, suited to be calibrated through standard FDP (Face Definition Parameters) and to be animated through standard FAP (Face Animation Parameters). The mandate of VIDAS included a deep and responsible commitment in the MPEG-4 standardisation process with special reference to SNHC (Synthetic/Natural Hybrid Coding) and FBA (Face and Body Animation). Part of VIDAS responsibilities have also concerned the dissemination of this kind of information to the scientific European community through a variety of initiatives like the periodic ACTS Concertation Meetings, ECMAST and IST conferences and “ad hoc” workshops like those organised by VIDAS itself in Rhodes (IWSNHC3DI’97) and in Santorini (IWSNHC3DI’99). Final goals of VIDAS have been the evaluation of the subjective impact produced by the synthetic facial animation on normal hearing and on hearing-impaired subjects, and the feasibility study for porting this MPEG-4 technology on real-time platforms.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recognition of Visual Events using Spatio-Temporal Information of the Video Signal

Recognition of visual events as a video analysis task has become popular in machine learning community. While the traditional approaches for detection of video events have been used for a long time, the recently evolved deep learning based methods have revolutionized this area. They have enabled event recognition systems to achieve detection rates which were not reachable by traditional approac...

متن کامل

Application of H.263+ Video Coding Modes in Lossy Packet Network Environments

The quality of real-time audio and video information transmitted via today’s Internet suffers severely from often significant packet losses. While this problem is well understood and solved for existing audio coding schemes, support from the video coding standards themselves is required for video streams. This paper presents the newly introduced error resilience mechanisms built into the second...

متن کامل

Analysis of Packet Loss and Latency Control for Robust IPTV over Mobile WiMAX and LTE Assessment (RESEARCH NOTE)

Abstract   The streamed audio video (AV) content for IPTV across mobile WiMAX channel, the different schemes were discussed to reduce the noise, packet loss and latency. The objective of this paper is to verify the effectiveness of forward error correction (FEC) techniques and to suggest the techniques for robustness problems and to analysis the issues either due to AV coding encoding or due to...

متن کامل

Fast Intra Mode Decision for Depth Map coding in 3D-HEVC Standard

three dimensional- high efficiency video coding (3D-HEVC) is the expanded version of the latest video compression standard, namely high efficiency video coding (HEVC), which is used to compress 3D videos. 3D videos include texture video and depth map. Since the statistical characteristics of depth maps are different from those of texture videos, new tools have been added to the HEVC standard fo...

متن کامل

Data Format and Coding for Free Viewpoint Video

In this paper we will discuss data representation formats and coding aspects of a new 3D functionality called Free Viewpoint Video (FVV). Similar to Computer Graphics (CG) or VRML applications, FVV allows for navigating with a virtual camera in 3D scenes. However, in contrast to graphical scenes in CG and VRML, FVV is based on natural scenes consisting of 3D video objects that provide a higher ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999